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1 IDS, content filtering, Java, etc.; A network intrusion detection system b as ed on the Q 

artificial neural networks 
Wang Jing-xin, Wang Zhi-ying, Dai Kui 

November 2004 Proceedings of the 3rd international conference on Information 
security InfoSecu v 04 

Publisher: ACM Press 

Full text available: fiQ pdf(371 .41 KB) Additional Information: full citation , abstract , references , index terms 



To address the problem of high false alarm rate confronted by the traditional intrusion 
detection systems, this paper presents a new method of applying the artificial neural 
networks to the network intrusion detection system. We designed and implemented a 
network intrusion detection system based on the artificial neural networks, and then 
several experiments have been carried out. For the known intrusions, the false alarm rate 
is less than 3 percent, and, for the unknown intrusions, the false al ... 



Keywords: BP classifier, artificial neural networks, false negative rate, false positive rate, 
intrusion detection system 



2 Minimax Regret Classifier for Imprecise Class Distributions Q 
Rocio Alaiz-Rodriguez, Alicia Guerrero-Curieses, Jesus Cid-Sueiro 

May 2007 The Journal of Machine Learning Research; volume 8 

Publisher: MIT Press 

Full text available: Q pdf(364.38 KB) Additional Information: full citation , abstract 

The design of a minimum risk classifier based on data usually stems from the stationarity 
assumption that the conditions during training and test are the same: the misclassification 
costs assumed during training must be in agreement with real costs, and the same 
statistical process must have generated both training and test data. Unfortunately, in real 
world applications, these assumptions may not hold. This paper deals with the problem of 
training a classifier when prior probabilities cannot ... 

3 A comparison of classifiers and document representations for the routing problem Q 
Hinrich Schutze, David A. Hull, Jan O. Pedersen 

July 1995 Proceedings of the 18th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '95 

Publisher: ACM Press 

Full text available: fQpdf (1,16 MB) Additional Information: full citation, references, citings, index terms 
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.Bernhard E. Boser, Isabelle M. Guyon, Vladimir N. Vapnik 

July 1992 Proceedings of the fifth annual workshop on Computational learning 
theory COLT '92 

Publisher: ACM Press 

Full text available - fS) pdfd 00 MB) Additional Information: full citation , abstract , references , citings , index 

' terms 

A training algorithm that maximizes the margin between the training patterns and the 
decision boundary is presented. The technique is applicable to a wide variety of the 
classification functions, including Perceptrons, polynomials, and Radial Basis Functions. 
The effective number of parameters is adjusted automatically to match the complexity of 
the problem. The solution is expressed as a linear combination of supporting patterns. 
These are the subset of training patterns that are closest t ... 

NeurAlign: combining word alignments using neural networks | 
Necip Fazil Ayan, Bonnie J. Dorr, Christof Monz 

October 2005 Proceedings of the conference on Human Language Technology and 
Empirical Methods in Natural Language Processing HLT '05 

Publisher: Association for Computational Linguistics 

Full text available: ^ pdfd 53. 52 KB ) Additional Information: full citation , abstract, references 

This paper presents a novel approach to combining different word alignments. We view 
word alignment as a pattern classification problem, where alignment combination is 
treated as a classifier ensemble, and alignment links are adorned with linguistic features. 
A neural network model is used to learn word alignments from the individual alignment 
systems. We show that our alignment combination approach yields a significant 20--34% 
relative error reduction over the best-known alignment combination t ... 

Effect of data compression of ERP sign preprocessed by FWT algorithm upon a ' 
neural network classifier 

S. DasGupta, M. Hohenberger, Len Trejo, T. Kaylani 

April 1990 ACM SIGSIM Simulation Digest , Proceedings of the 23rd annual 

symposium on Simulation ANSS '90, volume 20 issue 4 
Publisher: IEEE Press, ACM Press 

Full text available: ^| pdf(648.52 KB) Additional Information: full citation , abstract , references , index terms 

Earlier research at the Navy Personnel Research and Development Center revealed that 
measures of the brain response to sensory stimuli, known as Event Related Potentials 
(ERP) may be used to assess unique process-related variance that is dependent upon 
human performance. For example, it was found that the sensitivity of individual subjects 
to dynamic color contrast in computer displays can be assessed by visual ERP's. It has also 
been observed that RMS measures of the P1-N1-P2 complex and t ... 



7 A novel method for protein subcellular localization based on boosting and 

probabilistic neural network 
Jian Guo, Yuanlie Lin, Zhirong Sun 

January 2004 Proceedings of the second conference on Asia-Pacific bioinformatics - 
Volume 29 APBC 04 

Publisher: Australian Computer Society, Inc. 

Full text available: *Q pdf( 106,36 KB ) Additional Information: full citation, abstract, references, i ndex terms 

Subcellular localization is a key functional characteristic of proteins. An automatic, reliable 
and efficient prediction system for protein subcellular localization is needed for large-scale 
genome analysis. In this paper, we introduce a novel subcellular prediction method 
combining boosting algorithm with probabilistic neural network algorithm. This new 
approach provided superior prediction performance compared with existing methods. The 
total prediction accuracy on Reinhardt and Hubbard's data ... 

Keywords: amino acid composition, boosting, probabilistic neural network, subcellular 
localization 
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8 Voting with a parameterized veto strategy: solving the KDD Cup 2006 problem by Q 
means of a classifier commi t tee 

Domonkos Tikk, Zsolt T. Kardkovacs, Ferenc P. Szidarovszky 
December 2006 ACM SIGKDD Explorations Newsletter Volume 8 issue 2 
Publisher: ACM Press 

Full text available: 'gj pdf(612.54 KB) Additional Information: f u ll cit a tio n , abstract, referen c es 

This paper presents our winner solution for the KDD Cup 2006 problem. It is based on the 
results of three different supervised learning techniques which are then combined in a 
classifier committee, and finally a single solution is obtained with a voting procedure. The 
voting procedure assigns weights to each member of the committee according to their 
average performance on a ten-fold cross-validation test and it also takes into account the 
confidence values returned by the three algorithms. T ... 

9 S pecial Issue on learnin g theory: On the rate of convergence of re g ularized boostin g Q 
classifiers 

Gilles Blanchard, G'bor Lugosi, Nicolas Vayatis 

December 2003 The Journal of Machine Learning Research, volume 4 
Publisher: MIT Press 

Full text available: Q pdf(302.45 KB) Additional Information: full citation , abstract , citings , index terms 

A regularized boosting method is introduced, for which regularization is obtained through a 
penalization function. It is shown through oracle inequalities that this method is model 
adaptive. The rate of convergence of the probability of misclassification is investigated. It 
is shown that for quite a large class of distributions, the probability of error converges to 
the Bayes risk at a rate faster than n- (v+2 > /(4(v+1)) where V is the VC dimension of the "ba ... 

10 Fast Kernel Classifiers with Online and Active Learning Q 
Antoine Bordes, Seyda Ertekin, Jason Weston, Leon Bottou 

December 2005 The Journal of Machine Learning Research, volume 6 

Publisher: MIT Press 

Full text available: ^ pdf(577.37 KB) Additional Information: full citation , abstract 

Very high dimensional learning systems become theoretically possible when training 
examples are abundant. The computing cost then becomes the limiting factor. Any 
efficient learning algorithm should at least take a brief look at each example. But should 
all examples be given equal attention?This contribution proposes an empirical answer. We 
first present an online SVM algorithm based on this premise. LASVM yields competitive 
misclassification rates after a single pass over the training examples, ... 

11 Finknn: a fuzzy interval number k-nearest neighbor classifier for prediction of sugar Q 
production from p o pulations of samples 

Vassilios Petridis, Vassilis G. Kaburlasos 

December 2003 The Journal of Machine Learning Research, Volume 4 
Publisher: MIT Press 

Full text available- f fl pdf(360,76 KB) Addltional ,nformation: MUMon, abstract, references , citings, index 
yLjr ^ terms 

This work introduces FINkNN, a k-nearest-neighbor classifier operating over the metric 
lattice of conventional interval-supported convex fuzzy sets. We show that for problems 
involving populations of measurements, data can be represented by fuzzy interval 
numbers (FINs) and we present an algorithm for constructing FINs from such populations. 
We then present a lattice-theoretic metric distance between FINs with arbitrary-shaped 
membership functions, which forms the basis for FINkNN 1 ... 

12 Multimedia data mining: Automatic classification of speech and music using neural Q 
^ networks 

^ M. Kashif Saeed Khan, Wasfi G. Al-Khatib, Muhammad Moinuddin 
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November 2004 Proceedings of the 2nd ACM international workshop on Multimedia 
databases MMDB '04 

Publisher: ACM Press 

Full text available: *g] pdf(1.67 MB) Additional Information: full citation , abstract , references , index terms 

The importance of automatic discrimination between speech signals and music signals has 
evolved as a research topic over recent years. The need to classify audio into categories 
such as speech or music is an important aspect of many multimedia document retrieval 
systems. Several approaches have been previously used to discriminate between speech 
and music data. In this paper, we propose the use of the mean and variance of the 
discrete wavelet transform in addition to other features that have ... 

Keywords: audio features, audio signal processing, content-based indexing, music speech 
classification, neural networks 



13 Posters: Boosting SVM classifiers by ensemble Q 
Yan-Shi Dong, Ke-Song Han 

May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web WWW '05 

Publisher: ACM Press 

Full text available: ^|pdf (1 17.51 KB) Additional Information: full cit a tion, abstract , references , in dex t erms 

By far, the support vector machines (SVM) achieve the state-of-the-art performance for 
the text classification (TC) tasks. Due to the complexity of the TC problems, it becomes a 
challenge to systematically develop classifiers with better performance. We try to attack 
this problem by ensemble methods, which are often used for boosting weak classifiers, 
such as decision tree, neural networks, etc., and whether they are effective for strong 
classifiers is not clear. 

Keywords: classifier design and evaluation, information filtering, machine learning, neural 
nets, text processing 




14 A constructive algorithm for neural networks that g eneralize Q 
Alvin Surkan, Colin Campbell 

January 1998 ACM SIGAPL APL Quote Quad , Proceedings of the conference on Share 

knowledge share success APL '97, Volume 28 issue 4 
Publisher: ACM Press 

Full text available: Q pdf(650.04 KB) Additional Information: full citation, abgract, index terms 

APL functions were designed to describe a constructive algorithm that synthesizes a neural 
network while optimizing its ability to generalize. Algorithms are implemented in programs 
to discover networks of binary weights that assign unfamiliar, high-dimension binary 
patterns to their most similar classes. Constructive algorithms that create networks are 
important for the design of classifiers based on array-processors made from fast two-level 
circuits. APL is an effective tool for the exposition ... 

15 Connectionist temporal classification: labelling unsegmented sequence data with Q 
recurrent neural networks 

Alex Graves, Santiago Fernandez, Faustino Gomez, Jurgen Schmidhuber 
June 2006 Proceedings of the 23rd international conference on Machine learning 

ICML ( 06 
Publisher: ACM Press 

Full text available: Q pdf(298.26 KB) Additional Information: full citation , abstract , references , index terms 

Many real-world sequence learning tasks require the prediction of sequences of labels from 
noisy, unsegmented input data. In speech recognition, for example, an acoustic signal is 
transcribed into words or sub-word units. Recurrent neural networks (RNNs) are powerful 
sequence learners that would seem well suited to such tasks. However, because they 
require pre-segmented training data, and post-processing to transform their outputs into 
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label sequences, their applicability has so far been limited ... 

16 A context-aware mobile service discovery and selection mechanism using artificial 

^ neural networks 

^ Eyhab Al-Masri, Qusay H. Mahmoud 

August 2006 Proceedings of the 8th international conference on Electronic 

commerce: The new e-commerce: innovations for conquering current 
barriers, obstacles and limitations to conducting successful business on 
the internet ICEC 06 
Publisher: ACM Press 

Full text available: ^| pdf(447.28 KB) Additional Information: full citation , abstract , references , index terms 

In this paper we present SmartCon, a context-aware system for the discovery and 
selection of mobile services using Artificial Neural Networks (ANNs). The solution we have 
developed is a mobile agent-enabled system that adaptively and iteratively learns to select 
the best available mobile service derived from the extraction of a series of features 
utilizing contextual information such as the Composite Capabilities/Preferences Profile 
(CC/PP), service-specific, and non-uniform user-specific featur ... 

Keywords: CC/PP, artificial neural networks, backpropagation, mobile services, service 
discovery, service selection, software agents 



17 A simple approach to building ensembles of Naive Bayesian classifiers for word Q 

sense disambiguation 
Ted Pedersen 

April 2000 Proceedings of the first conference on North American chapter of the 
Association for Computational Linguistics 

Publisher: Morgan Kaufmann Publishers Inc. 

Full text available:^ pdf(677. 21 KB ) Additional Information: full citation, abstract, references, citings 

This paper presents a corpus-based approach to word sense disambiguation that builds an 
ensemble of Naive Bayesian classifiers, each of which is based on lexical features that 
represent co-occurring words in varying sized windows of context. Despite the simplicity of 
this approach, empirical results disambiguating the widely studied nouns line and interest 
show that such an ensemble achieves accuracy rivaling the best previously published 
results. 

18 A sub Bayesian nearest prototype neural network with fuzzy interpretability for Q 
diagnosis problems 

Saman Halgamuge, Christoph Grimm, Manfred Glesner 

February 1995 Proceedings of the 1995 ACM symposium on Applied computing SAC 
95 

Publisher: ACM Press 

Full text available: *g| pdf(508.72 KB) Additional Information: full citation , references , citings, index terms 




Keywords: Bayes classifier, fuzzy rules, neural networks, rule generation 



19 Pattern-based fault diagnosis using neural networks 
W. E. Dietz, E. L Kiech, M. Ali 

June 1988 Proceedings of the 1st international conference on Industrial and 

engineering applications of artificial intelligence and expert systems - 
Volume 1 IEA/AIE 88 

Publisher: ACM Press 

IIA . A . H Additional Information: full citation, abstract, reference_s, citings, index 

Full text available: TO pdf(1 .01 MB) ; ~ — * 

terms 

The detection and diagnosis of faults in real time are active areas of research in 
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knowledge-based expert systems. Several methods of diagnosis have been applied to a 
variety of physical systems. Rule-based approaches have been applied successfully to 
some domains. However, encoding knowledge in rule bases raises many difficult 
knowledge acquisition issues; in addition, rule-based systems are often too slow to be 
effectively applied in a real-time environment. More advanced diagnostic syste ... 

20 Security: Decision tree classifier for network intrusion detection with GA-based 

feature selection 
^ Gary Stein, Bing Chen, Annie S. Wu, Kien A. Hua 

March 2005 Proceedings of the 43rd annual Southeast regional conference - Volume 

2 ACM-SE 43 
Publisher: ACM Press 

Full text available: g pdf(103 MB) Additional Information: full citation , abstract , references , index terms 

Machine Learning techniques such as Genetic Algorithms and Decision Trees have been 
applied to the field of intrusion detection for more than a decade. Machine Learning 
techniques can learn normal and anomalous patterns from training data and generate 
classifiers that then are used to detect attacks on computer systems. In general, the input 
data to classifiers is in a high dimension feature space, but not all of features are relevant 
to the classes to be classified. In this paper, we use a gene ... 

Keywords: decision trees, genetic algorithm, intrusion detection 
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21 Learning to crawl: Comparing classification schemes 
Gautam Pant, Padmini Srinivasan 

October 2005 ACM Transactions on Information Systems (TOIS), volume 23 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , index terms . 
review 



Full text available: 



Topical crawling is a young and creative area of research that holds the promise of 
benefiting from several sophisticated data mining techniques. The use of classification 
algorithms to guide topical crawlers has been sporadically suggested in the literature. No 
systematic study, however, has been done on their relative merits. Using the lessons 
learned from our previous crawler evaluation studies, we experiment with multiple 
versions of different classification schemes. The crawling process is ... 



Keywords: Topical crawlers, classifiers, focused crawlers, machine learning 



22 Nonlinear Boosting Projections for Ensemble Construction Q 
Nicolas Garcia-Pedrajas, Cesar Garcia-Osorio, Colin Fyfe 

May 2007 The Journal of Machine Learning Research, Volume 8 
Publisher: MIT Press 

Full text available: ^pdf(10.56 MB) Additional Information: full citation , abstrac t 

In this paper we propose a novel approach for ensemble construction based on the use of 
nonlinear projections to achieve both accuracy and diversity of individual classifiers. The 
proposed approach combines the philosophy of boosting, putting more effort on difficult 
instances, with the basis of the random subspace method. Our main contribution is that 
instead of using a random subspace, we construct a projection taking into account the 
instances which have posed most difficulties to previous ... 

23 Statistical Comparison s of Classifiers over Mu l t iple Data Sets Q 
Janez Demsar 

December 2006 The Journal of Machine Learning Research; volume 7 
Publisher: MIT Press 

Full text available: ^[pdf(316.96 KB) Additional Information: f u l l c i tat ion, a b st ra c t 

While methods for comparing two learning algorithms on a single data set have been 
scrutinized for quite some time already, the issue of statistical tests for comparisons of 
more algorithms on multiple data sets, which is even more essential to typical machine 
learning studies, has been all but ignored. This article reviews the current practice and 
then theoretically and empirically examines several suitable tests. Based on that, we 
recommend a set of simple, yet safe and robust non-parametri ... 
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24 A seq uential alg orithm for trainin g text classifiers Q 
David D. Lewis, William A. Gale 

August 1994 Proceedings of the 17th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '94 

Publisher: Springer-Verlag New York, Inc. 

Full text available: ffl pdf(902.60 KB) Additional Information: full citation , references , citings, index terms 



25 Proximal support vector machine classifiers □ 

Glenn Fung, Olvi L. Mangasarian 
v August 2001 Proceedings of the seventh ACM SIGKDD international conference on 
Knowledge discovery and data mining KDD '01 
Publisher: ACM Press 

p_ ii a x , U1 £i ,, /7C , c -, Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf (755.57 KB ) - 

^ " terms 

Instead of a standard support vector machine (SVM) that classifies points by assigning 
them to one of two disjoint half-spaces, points are classified by assigning them to the 
closest of two parallel planes (in input or feature space) that are pushed apart as far as 
possible. This formulation, which can also be interpreted as regularized least squares and 
considered in the much more general context of regularized networks [8, 9], leads to an 
extremely fast and simple algorithm for generating a li ... 

Keywords: data classification, linear equations, support vector machines 

26 Special issue on word sense disambi g uation: Disambi g uatin g highly ambiguous Q 
words 

Geoffrey Towell, Ellen M. Voorhees 

March 1998 Computational Linguistics, volume 24 issue l 

Publisher: MIT Press 

Full text available:^ MA AO |f| 

]g.pdf(1.48 MB) ^ Additional Information: full citation , abstract , references , citings 

Publisher Site 

A word sense disambiguator that is able to distinguish among the many senses of common 
words that are found in general-purpose, broad-coverage lexicons would be useful. For 
example, experiments have shown that, given accurate sense disambiguation, the lexical 
relations encoded in lexicons such as WordNet can be exploited to improve the 
effectiveness of information retrieval systems. This paper describes a classifier whose 
accuracy may be sufficient for such a purpose. The classifier combines the ... 

27 Learning classifier systems and other genetics-based machine learnin g : Kernel- Q 
<H> based, ellipsoidal conditions in the real-valued XCS classifier system 

^ Martin V. Butz 

June 2005 Proceedings of the 2005 conference on Genetic and evolutionary 
computation GECCO '05 

Publisher: ACM Press 

r- „ . * , Ul « ., H cc ,, m Additional Information: full citation, abstract, references, citings, index 

Full text available: TOpdf(1,66 MB) — *~ 

^ terms 

Many learning classifier system (LCS) implementations are restricted to the binary 
problem realm. Recently, the XCS classifier system was enhanced to be able to handle 
real-valued inputs among others. In the real-valued enhancement, XCSF applies as a 
function approximation system that partitions the input space in hyperrectangular 
subspaces specified in the classifiers. This paper changes the classifier conditions to 
hyperspheres and hyperellipsoids and investigates the consequent performance im ... 

Keywords: GAs, XCS, function approximation, learning classifier systems, piece-wise 
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28 A doptin g Wildlife Expe riments f or Web Evolution Estimations : The Role of an Al Web Q 
Page Classifier 

Ioannis Anagnostopoulos, Photis Stavropoulos 

December 2006 Proceedings of the 2006 IEEE/ WIC/ ACM International Conference on 
Web Intelligence WI '06 

Publisher: IEEE Computer Society 

Full text available: "g| pd f(200.03 KB ) Additional Information: full citation , abstract 

This paper proposes a statistical approach for estimating the evolution of web pages in 
directories. The proposal is based on the capture-recapture method used in wildlife 
biological studies in an animal, bird or fish populations, and it is modified according to the 
necessary assumptions and amendments for applying the experiments in a search engine 
directory. During these experiments, web pages are considered as animals and the specific 
types of web pages as particular species of animals whose ... 

29 Learnin g Classifier systems an d other genetics- based m ach in e learning: papers: Q 
Classifier prediction based on tile coding 

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wilson, David E. Goldberg 
July 2006 Proceedings of the 8th annual conference on Genetic and evolutionary 

computation GECCO '06 
Publisher: ACM Press 

Full text available: Q pdf(489.82 KB) Additional Information: full citation , abstract , references , index terms 

This paper introduces XCSF extended with tile coding prediction: each classifier 
implements a tile coding approximator; the genetic algorithm is used to adapt both 
classifier conditions (i.e., to partition the problem) and the parameters of each 
approximator; thus XCSF evolves an ensemble of tile coding approximators instead of the 
typical monolithic approximator used in reinforcement learning. The paper reports a 
comparison between (i) XCSF with tile coding prediction and (ii) plain tile coding ... 

Keywords: LCS, RL, XCS, tile coding 




30 Poster pa pers: Evaluati ng class ifiers' perf ormance in a constrained env ironm ent Q 
Anna Olecka 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining KDD '02 

Publisher: ACM Press 

Full text available: Q_pdf(651J.6 KB) Additional Information: full citation, abstract, re f er en c es , index terms 

In this paper, we focus on methodology of finding a classifier with a minimal cost in 
presence of additional performance constraints. ROCCH analysis, where accuracy and cost 
are intertwined in the solution space, was a revolutionary tool for two-class problems. We 
propose an alternative formulation, as an optimization problem, commonly used in 
Operations Research. This approach extends the ROCCH analysis to allow for locating 
optimal solutions while outside constraints are present. Similarly to ... 

31 Multiclass Cancer Classification Using Semisupervised Ellipsoid ARTMAP and Q 
Particle Swarm Optimization with Gene Expression Data 

Rui Xu, Georgios C. Anagnostopoulos, Donald C. Wunsch 

January 2007 IEEE/ACM Transactions on Computational Biology and Bioinformatics 

(TCBB), Volume 4 Issue 1 
Publisher: IEEE Computer Society Press 

Full text available: ^)pdf( 3 .7 0 MB) Additional Information: full citation, abstr act, references, index terms 

It is crucial for cancer diagnosis and treatment to accurately identify the site of origin of a 
tumor. With the emergence and rapid advancement of DNA microarray technologies, 
constructing gene expression profiles for different cancer types has already become a 
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promising means for cancer classification. In addition to research on binary classification 
such as normal versus tumor samples, which attracts numerous efforts from a variety of 
disciplines, the discrimination of multiple tumor types is ... 

Keywords: Cancer classification, gene expression profile, semisupervised ellipsoid 
ARTMAP, particle swarm optimization. 
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We propose adaptive testing as a general mechanism for extracting information about 
stimuli from spike trains. Each test or question corresponds to choosing a neuron and a 
time interval and checking for a given number of spikes. No assumptions are made about 
the distribution of spikes or any other aspect of neural encoding. The chosen questions are 
those which most reduce the uncertainty about the stimulus, as measured by entropy and 
estimated from stimulus-response data. Our experiments are bas ... 
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We propose a maximal figure-of-merit (MFoM)-learning approach for robust classifier 
design, which directly optimizes performance metrics of interest for different target 
classifiers. The proposed approach, embedding the decision functions of classifiers and 
performance metrics into an overall training objective, learns the parameters of classifiers 
in a decision-feedback manner to effectively take into account both positive and negative 
training samples, thereby reducing the required size of pos ... 

Keywords: Text categorization, decision tree, generalized probabilistic descent method, 
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Small-form-factor, low-power wireless sensors-motes-are convenient to deploy, but lack 
the bandwidth to capture and transmit raw high-frequency data, such as human voices or 
neural signals, in real time. Local filtering can help, but we show that the right filter 
settings depend on changing ambient conditions and network effects such as congestion, 
which makes them dynamic and unpredictable. Mote collection systems for high-frequency 
data must support iteratively-tuned, deployment-specific filte ... 

Keywords: acoustics, health monitoring, motes, sensor networks, signal processing 
frameworks 
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This paper addresses a very specific problem that happens to be common in health science 
research. We present a machine learning based method for identifying patients diagnosed 
with congestive heart failure and other related conditions by automatically classifying 
clinical notes. This method relies on a Perceptron neural network classifier trained on 
comparable amounts of positive and negative samples of clinical notes previously 
categorized by human experts. The documents are represented as fea ... 
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The three corpus-based statistical sense resolution methods studied here attempt to infer 
the correct sense of a polysemous word by using knowledge about patterns of word 
cooccurrences. The techniques were based on Bayesian decision theory, neural, networks, 
and content vectors as used in information retrieval. To understand these methods better, 
we posed a very specific problem: given a set of contexts, each containing the noun line in 
a known sense, construct a classifier that selects t ... 
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In corporate data mining applications, cost-sensitive learning is firmly established for 
predictive classification algorithms. Conversely, data mining methods for regression and 
time series analysis generally disregard economic utility and apply simple accuracy 
measures. Methods from statistics and computational intelligence alike minimise a 
symmetric statistical error, such as the sum of squared errors, to model ordinary least 
squares predictors. However, applications in business elucidate that ... 

Keywords: asymmetric costs, cost-sensitive learning, data mining, neural networks, time 
series analysis 
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Use of semantic content is one of the major issues which needs to be addressed for 
improving image retrieval effectiveness. We present a new approach to classify images 
based on the combination of image processing techniques and hybrid neural networks. 
Multiple keywords are assigned to an image to represent its main contents, i.e. semantic 
content. Images are divided into a number of regions and colour and texture features are 
extracted. The first classifier, a self-organising map (SOM) clusters ... 

Keywords: content-based image retrieval, image indexing/classification, neural networks 
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An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs 
classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates 
between assigning "pseudo-classes" to the unlabeled data using the existing ensemble and 
constructing the next base classifier using both the labeled and pseudolabeled data. 
Mathematically, this intuitive algorithm corresponds to maximizing the classification 
margin in hypothesis space as measured on both the labeled and unlabel ... 

Keywords: boosting, classification, ensemble learning, semi-supervised learning 
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This paper presents an attribute clustering method which is able to group genes based on 
their interdependence so as to mine meaningful patterns from the gene expression data. It 
can be used for gene grouping, selection, and classification. The partitioning of a relational 
table into attribute subgroups allows a small number of attributes within or across the 
groups to be selected for analysis. By clustering attributes, the search dimension of a data 
mining algorithm is reduced. The reduction of ... 

Keywords: Data mining, attribute clustering, gene selection, gene expression 
classification, microarray analysis. 
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This paper demonstrates the substantial empirical success of classifier combination for the 
word sense disambiguation task. It investigates more than 10 classifier combination 
methods, including second order classifier stacking, over 6 major structurally different 
base classifiers (enhanced Naive Bayes, cosine, Bayes Ratio, decision lists, transformation- 
based learning and maximum variance boosted mixture models). The paper also includes 
in-depth performance analysis sensitive to properties ... 
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Bagging and boosting are two popular ensemble methods that typically achieve better 
accuracy than a single classifier. These techniques have limitations on massive data sets, 
because the size of the data set can be a bottleneck. Voting many classifiers built on small 
subsets of data ("pasting small votes") is a promising approach for learning from massive 
data sets, one that can utilize the power of boosting and bagging. We propose a 
framework for building hundreds or thousands of such classifie ... 
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Selection of machine learning techniques requires a certain sensitivity to the requirements 
of the problem. In particular, the problem can be made more tractable by deliberately 
using algorithms that are biased toward solutions of the requisite kind. In this paper, we 
argue that recurrent neural networks have a natural bias toward a problem domain of 
which biological sequence analysis tasks are a subset. We use experiments with synthetic 
data to illustrate this bias. We then demonstrate that thi ... 

Keywords: Index Terms- Machine learning, neural network architecture, recurrent neural 
network, bias, biological sequence analysis, motif, subcellular localization, pattern 
recognition, classifier design. 



53 Learning c lassifier s ystems a nd o ther genet ics- ba s ed machine lear ning: XCS with 

<£> com puted prediction i n m ul t istep environments 

^ Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wilson, David E. Goldberg 

June 2005 Proceedings of the 2005 conference on Genetic and evolutionary 
computation GECCO '05 

Publisher: ACM Press 

Full text available- f 51 ! pdf(324 08 KB) Additional Information: full citation , abstract , references , citings , index 
' ™ 1 terms 

XCSF extends the typical concept of learning classifier systems through the introduction of 
computed classifier prediction. Initial results show that XCSF's computed prediction can be 
used to evolve accurate piecewise linear approximations of simple functions. In this paper, 
we take XCSF one step further and apply it to typical reinforcement learning problems 
involving delayed rewards. In essence, we use XCSF as a method of generalized (linear) 
reinforcement learning to evolve piecewise linear a ... 

Keywords: LCS, RL, XCS, generalization 
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The purpose of this paper is to provide a PAC error analysis for the q-norm soft margin 
classifier, a support vector machine classification algorithm. It consists of two parts: 
regularization error and sample error. While many techniques are available for treating the 
sample error, much less is known for the regularization error and the corresponding 
approximation error for reproducing kernel Hilbert spaces. We are mainly concerned about 
the regularization error. It is estimated for gener ... 

55 MetaCost: a general method for making clas sif iers c os t-sen s iti ve Q 
Pedro Domingos 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
Knowledge discovery and data mining KDD '99 

Publisher: ACM Press 

Full text available: Q pdf(1.21 MB) Additional Information: full citation , references , citings , index terms 




http://portal.acm.org/resultsxfm?que^ 7/9/2007 



Results (page 3): neugent and (classifier or neural) Page 5 of 6 

56 Artificial neural network models for texture classification via: the Radon transform Q 
A. D. Kulkarni, P. Byars 

March 1992 Proceedings of the 1992 ACM/SIGAPP symposium on Applied computing: 
technological challenges of the 1990's SAC '92 

Publisher: ACM Press 

Full text available: Qpdf (448.54 KB) Additional Information: full citation , references , index terms 



57 Data streams: A self-o rganizing neural n etwork for detecting novelties 
Marcelo Keese Albertini, Rodrigo Fernandes de Mello 

March 2007 Proceedings of the 2007 ACM symposium on Applied computing SAC '07 
Publisher: ACM Press 

Full text available: ^£|pdf( 158.30 KB) Additional Information: full citation , abstract , references , index terms 

In order to detect new events, a system must support on-line learning, adapting to pattern 
dynamic characteristics. Studies of such adaptation have originated the novelty detection 
area, which aims at identifying unexpected or unknown patterns. These researches have 
motivated this work to propose the on-line and unsupervised Self-Organizing Novelty 
Detection (SONDE) neural network. In this network, the creation of new neurons points 
out novelties. Experiments evaluated the influence of ... 
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This paper focuses on the use of code features for automatic plagiarism detection. Instead 
of the text-based analyses employed by current plagiarism detectors, we propose a 
system that is based on properties of assignments that course instructors use to judge the 
similarity of two submissions. This system uses neural network techniques to create a 
feature-based plagiarism detector and to measure the relevance of each feature in the 
assessment. The system was trained and tested on assignments fro ... 

Keywords: cheating, features, introductory computer science, neural networks, 
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Named Entity (NE) recognition is a task in which proper nouns and numerical information 
are extracted from documents and are classified into categories such as person, 
organization, and date. It is a key technology of Information Extraction and Open-Domain 
Question Answering. First, we show that an NE recognizer based on Support Vector 
Machines (SVMs) gives better scores than conventional systems. However, off-the-shelf 
SVM classifiers are too inefficient for this task. Therefore, we present am... 
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